2024-05-10 06:48:15
Deep Multi-Task Learning for Malware Image Classification
Ahmed Bensaoud, Jugal Kalita
https://arxiv.org/abs/2405.05906 https://arxiv…
Deep Multi-Task Learning for Malware Image Classification
Ahmed Bensaoud, Jugal Kalita
https://arxiv.org/abs/2405.05906 https://arxiv…
This https://arxiv.org/abs/2404.14955 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization
Michael Kohler, Adam Krzyzak, Alisha S\"anger
https://arxiv.org/abs/2404.07128
End-to-End Generative Semantic Communication Powered by Shared Semantic Knowledge Base
Shuling Li, Yaping Sun, Jinbei Zhang, Kechao Cai, Shuguang Cui, Xiaodong Xu
https://arxiv.org/abs/2405.05738
This https://arxiv.org/abs/2401.12924 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
This https://arxiv.org/abs/2310.05446 has been replaced.
link: https://scholar.google.com/scholar?q=a
Mapping dissolved carbon in space and time: An experimental technique for the measurement of pH and total carbon concentration in density driven convection of CO$_2$ dissolved in water
Hilmar Yngvi Birggison, Yao Xu, Marcel Moura, Eirik Grude Flekk{\o}y, Knut J{\o}rgen M{\aa}l{\o}y
https://arxiv.org/abs/2405.05682
This https://arxiv.org/abs/2402.15784 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
This https://arxiv.org/abs/2309.08395 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csAI_…
LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification
Judy X Yang, Jun Zhou, Jing Wang, Hui Tian, Wee Chung Liew
https://arxiv.org/abs/2404.03883
Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo
Nayantara Mudur, Carolina Cuesta-Lazaro, Douglas P. Finkbeiner
https://arxiv.org/abs/2405.05255
Bayesian and Convolutional Networks for Hierarchical Morphological Classification of Galaxies
Jonathan Serrano-P\'erez, Raquel D\'iaz Hern\'andez, L. Enrique Sucar
https://arxiv.org/abs/2405.02366
This https://arxiv.org/abs/2203.11155 has been replaced.
link: https://scholar.google.com/scholar?q=a
How Deep Is Your Gaze? Leveraging Distance in Image-Based Gaze Analysis
Maurice Koch, Nelusa Pathmanathan, Daniel Weiskopf, Kuno Kurzhals
https://arxiv.org/abs/2404.18680
This https://arxiv.org/abs/2404.03493 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csNE_…
MedMamba: Vision Mamba for Medical Image Classification
Yubiao Yue, Zhenzhang Li
https://arxiv.org/abs/2403.03849 https://arxiv.org/p…
A consensus-constrained parsimonious Gaussian mixture model for clustering hyperspectral images
Ganesh Babu, Aoife Gowen, Michael Fop, Isobel Claire Gormley
https://arxiv.org/abs/2403.03349
Development and Validation of an Artificial Neural Network for the Recognition of Custom Dataset with YOLOv4
P. Veysi, M. Adeli, N. Peirov Naziri
https://arxiv.org/abs/2405.02298 …
Evaluating Adversarial Robustness: A Comparison Of FGSM, Carlini-Wagner Attacks, And The Role of Distillation as Defense Mechanism
Trilokesh Ranjan Sarkar, Nilanjan Das, Pralay Sankar Maitra, Bijoy Some, Ritwik Saha, Orijita Adhikary, Bishal Bose, Jaydip Sen
https://arxiv.org/abs/2404.04245<…
Data Augmentation Policy Search for Long-Term Forecasting
Liran Nochumsohn, Omri Azencot
https://arxiv.org/abs/2405.00319 https://arx…
This https://arxiv.org/abs/2404.03493 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csNE_…
Enhancing chest X-ray datasets with privacy-preserving large language models and multi-type annotations: a data-driven approach for improved classification
Ricardo Bigolin Lanfredi, Pritam Mukherjee, Ronald Summers
https://arxiv.org/abs/2403.04024
This https://arxiv.org/abs/2402.16896 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
This https://arxiv.org/abs/2403.01944 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2311.07740 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
EncodingNet: A Novel Encoding-based MAC Design for Efficient Neural Network Acceleration
Bo Liu, Grace Li Zhang, Xunzhao Yin, Ulf Schlichtmann, Bing Li
https://arxiv.org/abs/2402.18595
Deep Learning Descriptor Hybridization with Feature Reduction for Accurate Cervical Cancer Colposcopy Image Classification
Saurabh Saini, Kapil Ahuja, Siddartha Chennareddy, Karthik Boddupalli
https://arxiv.org/abs/2405.01600
Data Augmentation Policy Search for Long-Term Forecasting
Liran Nochumsohn, Omri Azencot
https://arxiv.org/abs/2405.00319 https://arx…
On machine learning analysis of atomic force microscopy images for image classification, sample surface recognition
Igor Sokolov
https://arxiv.org/abs/2403.16230
Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models
Hyeonggeun Yun
https://arxiv.org/abs/2404.09828 http…
Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Guoping Xu, Xiaxia Wang, Xinglong Wu, Xuesong Leng, Yongchao Xu
https://arxiv.org/abs/2405.01725
This https://arxiv.org/abs/2404.02388 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data
Iqra Bano, Rachmad Vidya Wicaksana Putra, Alberto Marchisio, Muhammad Shafique
https://arxiv.org/abs/2404.03493
This https://arxiv.org/abs/2311.07740 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2404.02388 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
Investigating the Robustness of Vision Transformers against Label Noise in Medical Image Classification
Bidur Khanal, Prashant Shrestha, Sanskar Amgain, Bishesh Khanal, Binod Bhattarai, Cristian A. Linte
https://arxiv.org/abs/2402.16734
This https://arxiv.org/abs/2404.02282 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
This https://arxiv.org/abs/2403.03849 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding
Masato Fujitake
https://arxiv.org/abs/2403.14252 https://
This https://arxiv.org/abs/2011.13602 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2404.02282 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
This https://arxiv.org/abs/2311.12481 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csHC_…
This https://arxiv.org/abs/2403.03849 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance
Huakun Shen, Boyue Caroline Hu, Krzysztof Czarnecki, Lina Marsso, Marsha Chechik
https://arxiv.org/abs/2402.19401
This https://arxiv.org/abs/2404.11003 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
Remote Sensing Image Enhancement through Spatiotemporal Filtering
Hessah Albanwan
https://arxiv.org/abs/2404.18950 https://arxiv.org/pdf/2404.18950
arXiv:2404.18950v1 Announce Type: new
Abstract: The analysis of time-sequence satellite images is a powerful tool in remote sensing; it is used to explore the statics and dynamics of the surface of the earth. Usually, the quality of multitemporal images is influenced by metrological conditions, high reflectance of surfaces, illumination, and satellite sensor conditions. These negative influences may produce noises and different radiances and appearances between the images, which can affect the applications that process them. Thus, a spatiotemporal bilateral filter has been adopted in this research to enhance the quality of an image before using it in any application. The filter takes advantage of the temporal information provided by multi temporal images and attempts to reduce the differences between them to improve transfer learning used in classification. The classification method used here is support vector machine (SVM). Three experiments were conducted in this research, two were on Landsat 8 images with low-medium resolution, and the third on high-resolution images of Planet satellite. The newly developed filter proved that it can enhance the accuracy of classification using transfer learning by about 5%,15%, and 2% for the three experiments respectively.
This https://arxiv.org/abs/2402.13699 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
Integrative Graph-Transformer Framework for Histopathology Whole Slide Image Representation and Classification
Zhan Shi, Jingwei Zhang, Jun Kong, Fusheng Wang
https://arxiv.org/abs/2403.18134
This https://arxiv.org/abs/2306.12465 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csNE_…
Text Role Classification in Scientific Charts Using Multimodal Transformers
Hye Jin Kim, Nicolas Lell, Ansgar Scherp
https://arxiv.org/abs/2402.14579 https…
Deep Learning Segmentation and Classification of Red Blood Cells Using a Large Multi-Scanner Dataset
Mohamed Elmanna, Ahmed Elsafty, Yomna Ahmed, Muhammad Rushdi, Ahmed Morsy
https://arxiv.org/abs/2403.18468
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
Hao Li, Ying Chen, Yifei Chen, Wenxian Yang, Bowen Ding, Yuchen Han, Liansheng Wang, Rongshan Yu
https://arxiv.org/abs/2402.19326
Automatic classification of prostate MR series type using image content and metadata
Deepa Krishnaswamy, B\'alint Kov\'acs, Stefan Denner, Steve Pieper, David Clunie, Christopher P. Bridge, Tina Kapur, Klaus H. Maier-Hein, Andrey Fedorov
https://arxiv.org/abs/2404.10892
This https://arxiv.org/abs/2306.08538 has been replaced.
link: https://scholar.google.com/scholar?q=a
Stitching Gaps: Fusing Situated Perceptual Knowledge with Vision Transformers for High-Level Image Classification
Delfina Sol Martinez Pandiani, Nicolas Lazzari, Valentina Presutti
https://arxiv.org/abs/2402.19339
A multi-stage semi-supervised learning for ankle fracture classification on CT images
Hongzhi Liu, Guicheng Li, Jiacheng Nie, Hui Tang, Chunfeng Yang, Qianjin Feng, Hailin Xu, Yang Chen
https://arxiv.org/abs/2403.19983
A multi-stage semi-supervised learning for ankle fracture classification on CT images
Hongzhi Liu, Guicheng Li, Jiacheng Nie, Hui Tang, Chunfeng Yang, Qianjin Feng, Hailin Xu, Yang Chen
https://arxiv.org/abs/2403.19983
This https://arxiv.org/abs/2301.04494 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
Breast Cancer Image Classification Method Based on Deep Transfer Learning
Weimin Wang, Min Gao, Mingxuan Xiao, Xu Yan, Yufeng Li
https://arxiv.org/abs/2404.09226
Distilling Datasets Into Less Than One Image
Asaf Shul, Eliahu Horwitz, Yedid Hoshen
https://arxiv.org/abs/2403.12040 https://arxiv.o…
This https://arxiv.org/abs/2402.17187 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2308.04956 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
SPLICE -- Streamlining Digital Pathology Image Processing
Areej Alsaafin, Peyman Nejat, Abubakr Shafique, Jibran Khan, Saghir Alfasly, Ghazal Alabtah, H. R. Tizhoosh
https://arxiv.org/abs/2404.17704 https://arxiv.org/pdf/2404.17704
arXiv:2404.17704v1 Announce Type: new
Abstract: Digital pathology and the integration of artificial intelligence (AI) models have revolutionized histopathology, opening new opportunities. With the increasing availability of Whole Slide Images (WSIs), there's a growing demand for efficient retrieval, processing, and analysis of relevant images from vast biomedical archives. However, processing WSIs presents challenges due to their large size and content complexity. Full computer digestion of WSIs is impractical, and processing all patches individually is prohibitively expensive. In this paper, we propose an unsupervised patching algorithm, Sequential Patching Lattice for Image Classification and Enquiry (SPLICE). This novel approach condenses a histopathology WSI into a compact set of representative patches, forming a "collage" of WSI while minimizing redundancy. SPLICE prioritizes patch quality and uniqueness by sequentially analyzing a WSI and selecting non-redundant representative features. We evaluated SPLICE for search and match applications, demonstrating improved accuracy, reduced computation time, and storage requirements compared to existing state-of-the-art methods. As an unsupervised method, SPLICE effectively reduces storage requirements for representing tissue images by 50%. This reduction enables numerous algorithms in computational pathology to operate much more efficiently, paving the way for accelerated adoption of digital pathology.
This https://arxiv.org/abs/2308.04956 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2304.02621 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
A slice classification neural network for automated classification of axial PET/CT slices from a multi-centric lymphoma dataset
Shadab Ahamed, Yixi Xu, Ingrid Bloise, Joo H. O, Carlos F. Uribe, Rahul Dodhia, Juan L. Ferres, Arman Rahmim
https://arxiv.org/abs/2403.07105
This https://arxiv.org/abs/2308.01381 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Spatial, Temporal, and Geometric Fusion for Remote Sensing Images
Hessah Albanwan
https://arxiv.org/abs/2404.17851 https://arxiv.org/pdf/2404.17851
arXiv:2404.17851v1 Announce Type: new
Abstract: Remote sensing (RS) images are important to monitor and survey earth at varying spatial scales. Continuous observations from various RS sources complement single observations to improve applications. Fusion into single or multiple images provides more informative, accurate, complete, and coherent data. Studies intensively investigated spatial-temporal fusion for specific applications like pan-sharpening and spatial-temporal fusion for time-series analysis. Fusion methods can process different images, modalities, and tasks and are expected to be robust and adaptive to various types of images (e.g., spectral images, classification maps, and elevation maps) and scene complexities. This work presents solutions to improve existing fusion methods that process gridded data and consider their type-specific uncertainties. The contributions include: 1) A spatial-temporal filter that addresses spectral heterogeneity of multitemporal images. 2) 3D iterative spatiotemporal filter that enhances spatiotemporal inconsistencies of classification maps. 3) Adaptive semantic-guided fusion that enhances the accuracy of DSMs and compares them with traditional fusion approaches to show the significance of adaptive methods. 4) A comprehensive analysis of DL stereo matching methods against traditional Census-SGM to obtain detailed knowledge on the accuracy of the DSMs at the stereo matching level. We analyze the overall performance, robustness, and generalization capability, which helps identify the limitations of current DSM generation methods. 5) Based on previous analysis, we develop a novel finetuning strategy to enhance transferability of DL stereo matching methods, hence, the accuracy of DSMs. Our work shows the importance of spatial, temporal, and geometric fusion in enhancing RS applications. It shows that the fusion problem is case-specific and depends on the image type, scene content, and application.
Generalizing deep learning models for medical image classification
Matta Sarah, Lamard Mathieu, Zhang Philippe, Alexandre Le Guilcher, Laurent Borderie, B\'eatrice Cochener, Gwenol\'e Quellec
https://arxiv.org/abs/2403.12167
This https://arxiv.org/abs/2402.06198 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
Generalizing deep learning models for medical image classification
Matta Sarah, Lamard Mathieu, Zhang Philippe, Alexandre Le Guilcher, Laurent Borderie, B\'eatrice Cochener, Gwenol\'e Quellec
https://arxiv.org/abs/2403.12167
PE-MVCNet: Multi-view and Cross-modal Fusion Network for Pulmonary Embolism Prediction
Zhaoxin Guo, Zhipeng Wang, Ruiquan Ge, Jianxun Yu, Feiwei Qin, Yuan Tian, Yuqing Peng, Yonghong Li, Changmiao Wang
https://arxiv.org/abs/2402.17187
This https://arxiv.org/abs/2404.06859 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
This https://arxiv.org/abs/2310.06085 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
Pneumonia Diagnosis through pixels -- A Deep Learning Model for detection and classification
Amit Karanth Gurpur, Janani S, Ajeetha B, Brintha Therese A, Rajeswaran Rangasami
https://arxiv.org/abs/2404.12405
This https://arxiv.org/abs/2403.03849 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2302.05262 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification
Shuai Li, Xiaoguang Ma, Shancheng Jiang, Lu Meng
https://arxiv.org/abs/2403.06798
Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification
Shuai Li, Xiaoguang Ma, Shancheng Jiang, Lu Meng
https://arxiv.org/abs/2403.06798
Integrating Preprocessing Methods and Convolutional Neural Networks for Effective Tumor Detection in Medical Imaging
Ha Anh Vu
https://arxiv.org/abs/2402.16221
SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification
Yuexi Du, Regina J. Hooley, John Lewin, Nicha C. Dvornek
https://arxiv.org/abs/2403.13148
This https://arxiv.org/abs/2402.01188 has been replaced.
link: https://scholar.google.com/scholar?q=a
Randomized Principal Component Analysis for Hyperspectral Image Classification
Mustafa Ustuner
https://arxiv.org/abs/2403.09117 https://
Learning to Classify New Foods Incrementally Via Compressed Exemplars
Justin Yang, Zhihao Duan, Jiangpeng He, Fengqing Zhu
https://arxiv.org/abs/2404.07507
Shortcut Learning in Medical Image Segmentation
Manxi Lin, Nina Weng, Kamil Mikolaj, Zahra Bashir, Morten Bo S{\o}ndergaard Svendsen, Martin Tolsgaard, Anders Nymark Christensen, Aasa Feragen
https://arxiv.org/abs/2403.06748
Shortcut Learning in Medical Image Segmentation
Manxi Lin, Nina Weng, Kamil Mikolaj, Zahra Bashir, Morten Bo S{\o}ndergaard Svendsen, Martin Tolsgaard, Anders Nymark Christensen, Aasa Feragen
https://arxiv.org/abs/2403.06748
This https://arxiv.org/abs/2302.02108 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
This https://arxiv.org/abs/2303.05789 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Self-supervised learning for classifying paranasal anomalies in the maxillary sinus
Debayan Bhattacharya, Finn Behrendt, Benjamin Tobias Becker, Lennart Maack, Dirk Beyersdorff, Elina Petersen, Marvin Petersen, Bastian Cheng, Dennis Eggert, Christian Betz, Anna Sophie Hoffmann, Alexander Schlaefer
https://arxiv.org/abs/2404.18599 https://arxiv.org/pdf/2404.18599
arXiv:2404.18599v1 Announce Type: new
Abstract: Purpose: Paranasal anomalies, frequently identified in routine radiological screenings, exhibit diverse morphological characteristics. Due to the diversity of anomalies, supervised learning methods require large labelled dataset exhibiting diverse anomaly morphology. Self-supervised learning (SSL) can be used to learn representations from unlabelled data. However, there are no SSL methods designed for the downstream task of classifying paranasal anomalies in the maxillary sinus (MS).
Methods: Our approach uses a 3D Convolutional Autoencoder (CAE) trained in an unsupervised anomaly detection (UAD) framework. Initially, we train the 3D CAE to reduce reconstruction errors when reconstructing normal maxillary sinus (MS) image. Then, this CAE is applied to an unlabelled dataset to generate coarse anomaly locations by creating residual MS images. Following this, a 3D Convolutional Neural Network (CNN) reconstructs these residual images, which forms our SSL task. Lastly, we fine-tune the encoder part of the 3D CNN on a labelled dataset of normal and anomalous MS images.
Results: The proposed SSL technique exhibits superior performance compared to existing generic self-supervised methods, especially in scenarios with limited annotated data. When trained on just 10% of the annotated dataset, our method achieves an Area Under the Precision-Recall Curve (AUPRC) of 0.79 for the downstream classification task. This performance surpasses other methods, with BYOL attaining an AUPRC of 0.75, SimSiam at 0.74, SimCLR at 0.73 and Masked Autoencoding using SparK at 0.75.
Conclusion: A self-supervised learning approach that inherently focuses on localizing paranasal anomalies proves to be advantageous, particularly when the subsequent task involves differentiating normal from anomalous maxillary sinuses. Access our code at https://github.com/mtec-tuhh/self-supervised-paranasal-anomaly
This https://arxiv.org/abs/2403.12167 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example
MingXuan Xiao, Yufeng Li, Xu Yan, Min Gao, Weimin Wang
https://arxiv.org/abs/2404.08279
This https://arxiv.org/abs/2308.13356 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2402.17187 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2404.03883 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2308.13356 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2310.09457 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Assessing the Performance of Deep Learning for Automated Gleason Grading in Prostate Cancer
Dominik M\"uller, Philip Meyer, Lukas Rentschler, Robin Manz, Daniel Hieber, Jonas B\"acker, Samantha Cramer, Christoph Wengenmayr, Bruno M\"arkl, Ralf Huss, Frank Kramer, I\~naki Soto-Rey, Johannes Raffler
https://arxiv.or…
DeepGleason: a System for Automated Gleason Grading of Prostate Cancer using Deep Neural Networks
Dominik M\"uller, Philip Meyer, Lukas Rentschler, Robin Manz, Jonas B\"acker, Samantha Cramer, Christoph Wengenmayr, Bruno M\"arkl, Ralf Huss, I\~naki Soto-Rey, Johannes Raffler
https://arxiv.org/abs/2403.16678…
Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and DINOv2 in Medical Imaging Classification
Yuning Huang, Jingchen Zou, Lanxi Meng, Xin Yue, Qing Zhao, Jianqiang Li, Changwei Song, Gabriel Jimenez, Shaowu Li, Guanghui Fu
https://arxiv.org/abs/2402.07595